Kv cache e2e add #1000

horheynm · 2024-12-20T04:14:43Z

SUMMARY:
"please provide a brief summary"

TEST PLAN:
"please outline how the changes were tested"

github-actions · 2024-12-20T04:14:56Z

👋 Hi! Thank you for contributing to llm-compressor. Please add the ready label when the PR is ready for review.

* remove sparseml utilities Signed-off-by: Kyle Sayers <kylesayrs@gmail.com> * use in model_load Signed-off-by: Kyle Sayers <kylesayrs@gmail.com> * remove use of RECIPE FILE NAME Signed-off-by: Kyle Sayers <kylesayrs@gmail.com> * rename to RECIPE_FILE_NAME, avoid circular import Signed-off-by: Kyle Sayers <kylesayrs@gmail.com> * remove qa ignore Signed-off-by: Kyle Sayers <kylesayrs@gmail.com> * replace tokenizer with processor Signed-off-by: Kyle Sayers <kylesayrs@gmail.com> * defer data collator changes Signed-off-by: Kyle Sayers <kylesayrs@gmail.com> --------- Signed-off-by: Kyle Sayers <kylesayrs@gmail.com> Co-authored-by: Dipika Sikka <dipikasikka1@gmail.com>

This reverts commit 5c53071.

* fix offload Signed-off-by: Dipika <dipikasikka1@gmail.com> * fix smoothquant offload bug * remove logtime --------- Signed-off-by: Dipika <dipikasikka1@gmail.com>

okwinds-fix

d6c23a8

kylesayrs and others added 5 commits December 19, 2024 23:16

Revert "KV Cache, E2E Tests (#742)" (#989)

0e1745e

This reverts commit 5c53071.

Fix SmoothQuant offload bug (#978)

c939f67

* fix offload Signed-off-by: Dipika <dipikasikka1@gmail.com> * fix smoothquant offload bug * remove logtime --------- Signed-off-by: Dipika <dipikasikka1@gmail.com>

Add LM Eval Configs (#980)

1059da0

add configs

8caf297

horheynm force-pushed the kv-cache-e2e-add branch from f7f8459 to 8caf297 Compare December 20, 2024 04:17

horheynm closed this Dec 20, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Kv cache e2e add #1000

Kv cache e2e add #1000

horheynm commented Dec 20, 2024

github-actions bot commented Dec 20, 2024

Kv cache e2e add #1000

Kv cache e2e add #1000

Conversation

horheynm commented Dec 20, 2024

github-actions bot commented Dec 20, 2024